Efficient Parallel Solutions of Large Sparse Spd Systems on Distributed-memory Multiprocessors

نویسنده

  • CHUNGUANG SUN
چکیده

We consider several issues involved in the solution of sparse symmetric positive deenite systems by multifrontal method on distributed-memory multiprocessors. First, we present a new algorithm for computing the partial factorization of a frontal matrix on a subset of processors which signiicantly improves the performance of a distributed multifrontal algorithm previously designed. Second, new parallel algorithms for computing sparse forward elimination and sparse backward substitution are described. The new algorithms solve the sparse triangular systems in a multifrontal fashion. Numerical experiments run on an Intel iPSC/860 and an Intel iPSC/2 for a set of problems with regular and irregular sparsity structure are reported. More than 180 million ops per second during the numerical factorization are achieved for a three-dimensional grid problem on an iPSC/860 machine with 32 processors. 1. Introduction. The numerical solution of a large sparse symmetric positive deenite system Ax = b lies at the heart of many scientiic and engineering computations. One way of solving such a system is rst to compute the sparse Cholesky factorization PAP T = LL T , where L is a lower triangular matrix and P is a permutation matrix for reducing the number of nonzeros in L. The solution vector is computed by solving the two sparse triangular systems Lu = Pb and L T v = u, and setting x = P T v. Typically, a direct solution process involves four steps: (i) computation of a ll-reducing ordering, (ii) symbolic factorization, (iii) numerical factorization, and (iv) triangular solution. George and Liu 13] provide a comprehensive discussion on these four steps and their implementations on serial machines. We focus our attention on the design of eecient parallel algorithms for the numerical factorization and the triangular solution on distributed-memory multiprocessors. We assume that the symbolic processing steps including ordering and symbolic factorization are done sequentially. Signiicant eeort has been invested on designing eecient parallel algorithms for solving sparse symmetric positive deenite systems on distributed-memory multiprocessors. Heath, Ng and Peyton 17] have recently published a survey on parallel sparse matrix factorization algorithms. There are roughly three classes of algorithms for computing sparse Cholesky factorization on distributed-memory machines|fan-out algorithm 12], fan-in algorithm 2] and multifrontal algorithm 8]. Ashcraft, Eisenstat, Liu and Sherman 3] have reported that the multifrontal method is more eecient than both fan-out and fan-in algorithms on an iPSC/2 machine for the 63 63 nine-point regular grid. Pothen and Sun 25] have …

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Parallel Solution of Sparse Linear Least Squares Problems on Distributed-Memory Multiprocessors

This paper studies the solution of large-scale sparse linear least squares problems on distributed-memory multiprocessors. The method of corrected semi-normal equations is considered. New block-oriented parallel algorithms are developed for solving the related sparse triangular systems. The arithmetic and communication complexities of the new algorithms applied to regular grid problems are anal...

متن کامل

Parallel Solution of Sparse Linear Least Squares Problemson Distributed - Memory

This paper studies the solution of large-scale sparse linear least squares problems on distributed-memory multiprocessors. The method of corrected semi-normal equations is considered. New block-oriented parallel algorithms are developed for solving the related sparse triangular systems. The arithmetic and communication complexities of the new algorithms applied to regular grid problems are anal...

متن کامل

Experience with Fine-Grain Communication in EM-X Multiprocessor for Parallel Sparse Matrix Computation

Sparse matrix problems require a communication paradigm different from those used in conventional distributed-memory multiprocessors. We present in this paper how fine-grain communication can help obtain high performance in the experimental distributed-memory multiprocessor, EM-X, developed at ETL, which can handle fine-grain communication very efficiently. The sparse matrix kernel, Conjugate G...

متن کامل

Parallelization of Multilevel ILU Preconditioners on Distributed-Memory Multiprocessors

In this paper, we investigate the parallelization of the ILUPACK library for the solution of sparse linear systems on distributed-memory multiprocessors. Preliminary experimental results on a cluster of Intel QuadCore processors report remarkable speed-ups for our approach.

متن کامل

Iterative and Direct Sparse Solvers on Parallel Computers

Solving large sparse systems of linear equations is required for a wide range of numerical applications. This paper addresses the main issues raised during the parallelization of iterative and direct solvers for such systems in distributed memory multiprocessors. If no preconditioning is considered, iterative solvers are simple to parallelize, as the most time-consuming computational structures...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007